14:00
2026-06-06
arxiv.org
large-language-models
Benchmarks in Leipzig
A group of 49 mathematicians compiled a dataset of 100 research-level mathematics questions with known answers during a workshop at the Max Planck Institute for Mathematics in the Sciences in Leipzig,โฆ